Value Iteration Networks

نویسندگان

Aviv Tamar

Yi Wu

Garrett Thomas

Sergey Levine

Pieter Abbeel

چکیده

We introduce the value iteration network (VIN): a fully differentiable neural network with a ‘planning module’ embedded within. VINs can learn to plan, and are suitable for predicting outcomes that involve planning-based reasoning, such as policies for reinforcement learning. Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a convolutional neural network, and trained end-to-end using standard backpropagation. We evaluate VIN based policies on discrete and continuous path-planning domains, and on a natural-language based search task. We show that by learning an explicit planning computation, VIN policies generalize better to new, unseen domains.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Soft Value Iteration Networks for Planetary Rover Path Planning

Value iteration networks are an approximation of the value iteration (VI) algorithm implemented with convolutional neural networks to make VI fully differentiable. In this work, we study these networks in the context of robot motion planning, with a focus on applications to planetary rovers. The key challenging task in learningbased motion planning is to learn a transformation from terrain obse...

متن کامل

The variational iteration method for a class of tenth-order boundary value differential equations

متن کامل

Cooperative Motion Planning for Non-Holonomic Agents with Value Iteration Networks

Cooperative motion planning is still a challenging task for robots. Recently, Value Iteration Networks (VINs) were proposed to model motion planning tasks as Neural Networks. In this work, we extend VINs to solve cooperative planning tasks under non-holonomic constraints. For this, we interconnect multiple VINs to pay respect to each other’s outputs. Policies for cooperation are generated via i...

متن کامل

Application of variational iteration method for solving singular two point boundary value problems

In this paper, He's highly prolic variational iteration method is applied ef-fectively for showing the existence, uniqueness and solving a class of singularsecond order two point boundary value problems. The process of nding solu-tion involves generation of a sequence of appropriate and approximate iterativesolution function equally likely to converge to the exact solution of the givenproblem w...

متن کامل

Stochastic Assessment of Voltage Sags in Distribution Networks

This paper compares fault position and Monte Carlo methods as the most common methods in stochastic assessment of voltage sags. To compare their abilities, symmetrical and unsymmetrical faults with different probability distribution of fault positions along the lines are applied in a test system. The voltage sag magnitude in different nodes of test system is calculated. The problem with the...

متن کامل

Variational Iteration Method for Free Vibration Analysis of a Timoshenko Beam under Various Boundary Conditions

In this paper, a relatively new method, namely variational iteration method (VIM), is developed for free vibration analysis of a Timoshenko beam with different boundary conditions. In the VIM, an appropriate Lagrange multiplier is first chosen according to order of the governing differential equation of the boundary value problem, and then an iteration process is used till the desired accuracy ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Value Iteration Networks

نویسندگان

چکیده

منابع مشابه

Soft Value Iteration Networks for Planetary Rover Path Planning

The variational iteration method for a class of tenth-order boundary value differential equations

Cooperative Motion Planning for Non-Holonomic Agents with Value Iteration Networks

Application of variational iteration method for solving singular two point boundary value problems

Stochastic Assessment of Voltage Sags in Distribution Networks

Variational Iteration Method for Free Vibration Analysis of a Timoshenko Beam under Various Boundary Conditions

عنوان ژورنال:

اشتراک گذاری